Building a corpus and testing speech recognition

نویسنده

  • D. Bussink
چکیده

Speech recognition is still a rapidly developing area of research. It is used for all kinds of applications. In this article a corpus will be build that consists of sentences collected in human-computer dialogs. The corpus will be created in a Wizard-of-Oz environment where the user has to perform several tasks in a virtual environment related to navigation. The raw audio collected in these experiments was transcribed and then recognized by a large vocabulary recognizer. The first recognition results indicate that using a large vocabulary recognizer with a standard language and acoustic model gives poor recognition. It is curious that the recognition performance differs greatly with different participants in the experiment. Possible future steps are testing the corpus with adapted acoustic or language models, or using it for evaluating other speech recognition systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Isolated Word Recognition using Morph – Knowledge for Telugu Language

Building a speech recognition system for Indian languages is an open question and requires focus. This paper highlights on a new model for speech recognition system and uses syllable as the basic unit. This model has five phases, the first three phases focused on training the data and building Trie structure to reduce the time and space and the last two phases are for testing. Training includes...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Development of Speech corpora for different Speech Recognition tasks in Malayalam language

Speech corpus is the backbone of an Automatic speech Recognition system. This paper presents the development of speech corpora for different speech recognition tasks in Malayalam language. Pronunciation dictionary and Transcription file which are the other two essential resources for building a speech recognizer are also being created. Speech recognition performance of different speech recognit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006